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PRDIT-SPECIPIC 
TRANSCRIPTIONAL FACTORS 

5 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of 
Application Serial No. 168,190, filed March 15, 1988, 
which is a continuation-in-part of Application Serial 
10 No. 054,369 filed May 26, 1987, which applications are 
incorporated herein by reference. 

INTRODUCTION 

Technical Field 
.15 This invention relates to DNA expression cas- 

settes capable of directing fruit-specific expression 
of in vitro constructed expression cassettes in plants. 
The invention is exemplified by promoters useful in 
fruit-specific transcription in a tomato plant. 

20 

Background 

Manipulation of plants has proven to be 
significantly more difficult than manipulation of 
prokaryotes and mammalian hosts. As compared to 

25 prokaryotes and mammalian cells, much less was known 
about the biochemistry and cell biology of plant cells 
and plants. The ability to transform plant cells and 
regenerate plants is unique to flora since other 
differentiated species provide readily available trans- 

30 formable germ cells which may be fertilized and intro- 
duced into the live host for fetal development to a 
mature fetus« There has been substantial interest in 
modifying the ovum with inducible transcriptonal initi- 
ation regions to afford inducible transcription and 

35 expression of the gene introduced into the ovum, rather 
than constituitive expression which would result in ex- 
pression throughout the fetus. 
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Also, for plantsr it is frequently desirable 
to be able to control expression at a particular stage 
in the growth of the plant or in a particular plant 
part. During the various stages of the growth of the 
5 plant, and as to the various components of the plant, 
it will frequently be desirable to direct the effect of 
the construct introduced into the entire plant or a 
particular part and/or to a particular stage of differ- 
entiation of the plant cell. For this purpose, regula- 

10 tory sequences are required which afford the desired 
initiation of transcription in the appropriate cell 
types and/or at the appropriate time in the plant de- 
velopment, without having serious detrimental effects 
on the plant development and productivity. 

15 It is therefore important to be able to iso- 

late sequences which can be manipulated to provide the 
desired regulation of transcription in a plant cell 
host during the growing cycle of the plant. One aspect 
of this interest is the ability to change the phenctype 

20 of fruit, so as to provide fruit which will have im- 
proved aspects for storage, handling, cooking, organ- 
oleptic properties, freezing, nutritional value, and 
the like. 

25 Relevant Literature 

cDNA clones from tomato displaying differen- 
tial expression during fruit development have been iso- 
lated and characterized (Mansson et al., Mol. Gen. 
Genet. (1985) 200:356-361; Slater et al. , Plant Mol. 

30 Biol. (1985) 5:137-147). The studies have focused pri- 
marily on mRNAs which accumulate during fruit ripening. 
One of the proteins encoded by the ripening-specific 
cDNAs has been identified as polygalacturonase (Slater 
et al.. Plant Mol. Biol. (1985) 5:137-147). A cDNA 

35 clone which encodes tomato polygalacturonase has been 
sequenced. Grierson et al.. Nucleic Acids Research 
(1986) 14:8395-8603. The concentration of polygalac- 
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turonase mRNA increases 2000*£old between the immature- 
green and red-ripe stages of fruit development. This 
suggests that expression of the enzyme is regulated by 
the specific mRNA concentration which in turn is regu- 
5 lated by an increase in transcription. Delia Penna et 
al., Proc. Natl. Acad. Sci. PSA (1986) 83:6420-6424. 
Nature plastid mRNA for psbA (one of the components of 
photosystem II) reaches its highest level late in fruit 
development/ whereas after the onset of ripening, plas- 

10 tid mRNAs for other components of photosystem I and II 
decline to nondetectable levels in chromoplasts . 
Piechulla et al.r Plant Mol. Biol. (1986) 7:367-376. 

Other studies have focused on cONAs encoding 
genes under inducible regulation, e.g. proteinase in- 

15 hibitors which are expressed in response to wounding in 
tomato (Graham et al.r J. Biol. Chem. (1985) 260 :6555- 
6560? Graham et al.r J. Biol. Chem. (1985) 260:6561- 
6564) and on mRNAs correlated with ethylene synthesis 
in ripening fruit and leaves after wounding. Smith et 

20 al., Planta (1986) 168:94-100^ 

Leaf disc transformation of cultivated tomato 
is described by McCormick, et al. , Plant Cell Reports 
(1986) 5:81-89. 

25 SUMMARY OF THE INVENTION 

Novel ONA constructions are provided employing 
a "fruit-specific promoter, particularly those active 
beginning at or shortly after anthesis or beginning at 
the breaker stage / joined to a DNA sequence of interest 

30 and a transcriptional termination region. A DNA 

construct may be introduced into a plant cell host for 
integration into the genome and transcription regulated 
at a time at or subsequent to anthesis. In this 
manner, high levels of RNA and, as appropriate, 

35 polypeptides, may be achieved during formation and/or 
ripening of fruit. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 showc the nucleotide sequence of the 
cDNA clones pCGN1299 (2A11) and pCGN1298 (3H11). The 
amino acid sequence o£ the polypeptide encoded by the 
5 open reading £raroe is also indicated. 

Figure 2 is a comparison of 2A11 to pea stor- 
age proteins and other abundant storage proteins: 

(a) 2A11 (residues 33-46) is compared 
to PAlb and the reactive site sequences of some prote- 

10 ase inhibitors^ PAlb (residues 6-23)/ chick pea inhibi- 
tor (residues 11-23)/ lima bean inhibitor (residues 
23-35)/ human al-antitrypsin reactive site peptide. 
The arrow indicates the reactive site. 

(b) is a comparison of the amino ter- 
15 minal sequence of 2A11 with the amino termini of a 

range of seed proteins. The data have been modified or 
deletions introduced to maximize homology; conserved 
residues are shown boxed. The sequences are from the 
following sources: PAlb; barley cMoroform/methanol- 

20 soluble protein d; wheat albumin; wheat o-amylase 

inhibitor 0.28; millet bi-functional inhibitor; castor 
bean 2S small subunit; and napin small subunit. 

Figure 3 is a schematic diagram of the con- 
struction of the binary plasmid pCGN783; (a) through 

25 (f) refer to the plasmid constructions in Example 6.1. 

Figure 4 shows the complete sequence of the 
2A11 genomic DNA cloned into pCGN1273 from the Xho l 
site (position 1 at the 5* end) to the EcoRI site 
(position 4654). 

30 Figure 5 shovs the nucleotide sequence of a 

polygalacturonase (PG) genomic clone. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the subject invention, DNA 
35 constructs are provided which allow for modification of 
plant phenotype during fruit maturation and ripening. 
The DNA constructs provide for a regulated transcrip- 
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tional initiation region associated with fruit develop- 
ment and ripening. Downstream from and under the tran- 
scriptional initiation regulation of the fruit related 
initiation region will be a sequence of interest which 
5 will provide for modification of the phenotype of the 
fruit. Desirably, integration constructs may be pre- 
pared which allow for integration of the transcrip- 
tional cassette into the genome of a plant host. Con- 
veniently r the vector may include a multiple cloning 
10 site downstream from the fruit related transcriptional 
initiation region, so that the integration construct 
may be employed for a variety of sequences in an 
efficient manner. 

Of particular interest is a transcriptional 
15 initiation region which is activated at dr shortly 

after anthesisr so that in the early development of the 
fruit, it provides the desired level of transcription 
of the sequence of interest. Normally, the sequence of 
interest will be involved in affecting the process in 
20 the early formation of the fruit or providing a proper- 
ty which is desirable during the growing (expansion) 
period of the fruit, or at or after harvesting. 

The ripening stages of the tomato may be 
broken down into mature green, breaker, turning, pink, 
25 light red and red. Desirably, the transcriptional 
initiation region maintains its activity during the 
expansion and maturation of the green fruit, more de- 
sirably continues active through the ripening or red 
fruit period. Comparable periods for other fruit are 
30 referred to as stages of ripening. The invention is 

not limited to those transcriptional initiation regions 
which are activated at or shortly after anthesis but 
also includes transcriptional initiation regions which 
are activated at any of the ripening stages of the 
35 fruit. 
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The transcriptional initiation region may be 
native or homologous to the host or foreign or heterol- 
ogous to the host. By foreign is intended that the 
transcriptional initiation region is not found in the 
5 wild-type host into which the transcriptional initia- 
tion region is introduced. Of particular interest is a 
tomato fruit-specific transcriptional initiation region 
referred to as 2A11 which regulates the expression of a 
2A11 cDNA sequence described in the Experimental sec- 

10 tion. The 2A11 transcriptional initiation region pro- 
vides for an abundant messenger, being activated at or 
shortly after anthesis and remaining active until the 
red fruit stage. The expressed protein is a sulfur- 
rich protein similar to other plant storage proteins in 

15 sulfur content and size. Also of interest is the tran- 
scriptional initiation region which regulates expres- 
sion of the enzyme polygalacturonase, an enzyme which 
plays an important role in fruit ripening. The poly- 
galacutonase promoter is active in at least the breaker 

20 through red fruit stage. 

Other fruit-specific promoters may be acti- 
vated at times subsequent to anthesis, such as prior to 
or during the green fruit stage, during pre-ripe (e.g., 
breaker) or even into the red fruit stage. 

25 A transcriptional initiation region may be em- 

ployed for varying the phenotype of the fruit. Various 
changes in phenotype are of interest. These changes 
may include up- or down-regulation of formation of a 
particular saccharide, involving mono- or polysacchar- 

30 ides, involving such enzymes as polygalacturonase, 

levansucrase, dextransucrase, invertase, etc.; enhanced 
lycopene biosynthesis; cytokinin and monellin synthesis. 
Other properties of interest for modification include 
response to stress, organisms, herbicides, bruising, 

35 mechanical agitation, etc,^ change in growth regula- 
tors, organoleptic properties, etc. For antisense or 
complementary sequence transcription, the sequence will 



usually be at least 12, more usually at least 16 nt. 
Antisense sequences of interest include those of poly- 
galacturonase / sucrase synthase and invertase. 

The transcriptional cassette will include in 
the 5 '-3* direction of transcription, a transcriptional 
and translational initiation region, a sequence of in- 
terest, and a transcriptional and transI^^Monal termi- 
nation region functional in plants. One or more in- 
trons may be also be present. The DNA sequence may 
have any open reading frame encoding a peptide of 
interest, e.g. an enzyme, or a sequence complementary 
to a genomic sequence, where the genomic sequence may 
be an open reading frame, an intron, a non-coding 
leader sequence, or any other sequence where the com- 
plementary sequence will inhibit transcription, mes- 
senger RNA processing, e.g. splicing, or translation. 
The DNA sequence of interest may be synthetic, natur- 
ally derived, or combinations thereof. Depending upon 
the nature of the ONA sequence of interest, it may be 
desirable to synthesize the sequence with plant pre- 
ferred codons. The plant preferred codons may be de- 
termined from the codons of highest frequency in the 
proteins expressed in the largest amount in the partic- 
ular plant species of interest* 

In preparing the transcription cassette, the 
various DNA fragments may be manipulated, so as to pro- 
vide for the DNA sequences in the proper orientation 
and, as appropriate, in the proper reading frame. To- 
ward this end, adapters or linkers may be employed for 
joining the DNA fragments or other manipulations may be 
involved to provide for convenient restriction sites, 
removal of superfluous DNA, removal of restriction 
sites, or the like. Toward this end, in vitro muta- 
genesis, primer repair, restriction, annealing, resec- 
tion, ligation, or the like may be employed, where in- 
sertions, deletions or substitutions, e.g. transitions 
and transversions, may be involved. 
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The termination region which is employed will 
tZT ' -venience, since the ^erLnat o 

terlnaM''"' T "'^^'^^'^ interchangeable. The 
i tioli " "^^'^^ transcrip. 

tional initiation region, may be native with the DNA 
sequence of interest, or may be derived from another 
Trir.: '"""""'""^ termination regions a.e available 
from the Ti-plasmid of A. tumefaciens . such as the oc- 
topxne synthase and nopaline synthase termination 
regions. 

By appropriate manipulations, such as restric- 
tion. Chewing back or tilling in overhangs to provide 

ln"t"r"''; " " e- 

mentary ends oe the fragments can be provided for join- 
ing and ligation. ' 

emnlo. „ 'lonins 
employed, so as to amplify the amount of DNA and to al- 
low for analyzing the DNA to ensure that the operations 
have occurred in proper manner, a wide variety of 
Cloning vectors are available, where the cloning vector 
includes a replication system functional in E. coli and 
a marker Which allows for selection of the tlanHITmed 
cells. Illustrative vectors include pBR332, pac ser- 
ies. Ml 3mp series, pAcyeie4, etc. thus, the sequence 
«ay be inserted into the vector at an appropriate re- 
form th!".,':' ' P^""" ""^ " '"ns- 
form the E. coll host, the E. coU grown in an appro- 
priate nutrient medium and the cells harvested and 
lysed and the plasmid recovered. Analysts may involve 
-e^enc. analysis, restriction analysis, electrophore- 
sis, or the like. After each manipulation the DKA se- 
quence to be used m the final construct may be re- 

the partial constucts may be cloned in the same or 
different plasmids. 
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In addition to the transcription construct, 
depending upon the manner of introduction of the trans- 
cription construct into the plant, other DNA sequences 
may be required. For example, when using the Ti- or 
5 Ri-plasmid for transformation of plant cells, as de- 
scribed below, at least the right border and frequently 
both the right and left borders of the T-DNA of the Ti- 
er Ri-plasmids will be joined as flanking regions to 
the transcription construct. The use of T-DNA for 

10 transformation of plant cells has received extensive 

study and is amply described in EPA Serial No. 120,516, 
Hoekema, In: The Binary Plant Vector System Offset- 
drukkerij Ranters B.V., Alblasserdam, 1985, Chapter V, 
Knauf et al . , Genetic Analysis of Host Range Expression 

15 by Aqrobacterium , In:Molecular Genetics of the Bacteria- 
Plant Interaction, Puhler, A. ed. ,Springer-Verlag, NY, 
. 1983, p.245r and An et al., EMBO J. (1985) 4:277-284 

Alternatively, to enhance integration into the 
plant genome, terminal repeats of transposons may be 

20 used as borders in conjunction with a transposase. In 
this situation, expression of the transposase should be 
inducible, so that once the transcription construct is 
integrated into the genome, it should be relatively 
stably integrated and avoid hopping. 

25 The transcription construct will normally be 

joined to a marker for selection in plant cells. Con- 
veniently, the marker may be resistance to a biocide, 
particularly an antibiotic, such as kanamycin, G418, 
bleomycin, hygromycin, chloramphenicol, or the like. 

30 The particular marker employed will be one which will 
allow for selection of transformed cells as compared to 
cells lacking the DNA which has been introduced. 

A variety of techniques are available for the 
introduction of DNA into a plant cell host. These 

35 techniques include transformation with Ti-DNA employing 
A. tumefaciens or A, rhizoqenes as the transforming 
agent, protoplast fusion. Injection, electroporation, 
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etc. For transformation with Agrobacterium , plasmids 
can be prepared in E. coli which plasmids contain DNA 
homologous with the Ti-plasmid, particularly T-DNA. 
The plasraid may or may not be capable of replication in 
5 Aqrobacterium . that is, it may or may not have a broad 
spectrum prokaryotic replication system, e.g., RK290, 
depending in part upon whether the transcription con- 
struct is to be integrated into the Ti-plasmid or be 
retained on an independent jplasmid. By means of a 

10 helper plasmid, the transcription construct may be 
transferred to the A. tumefaciens and the resulting 
transformed organism used for transforming plant cells. 

Conveniently, explants may be cultivated with 
the A. tumefaciens or A. rhizoqenes to allow for trans- 

15 fer of the transcription construct to the plant cells, 
the plant cells dispersed in an appropriate selective 
medium for selection, grown to callus, shoots grown and 
plantlets regenerated from the callus by growing in 
rooting medium. The Aqrobacterium host will contain a 

20 plasraid having the vir genes necessary for transfer of 
the T-DNA to the plant cells and may or may not have 
T-DNA. For injection and electroporation, disarmed 
Ti-plasmids (lacking the tumor genes, particularly the 
T-DNA region) may be introduced into the plant cell. 

25 As a host cell, any of a number of fruit bear- 

ing plants may be employed in which the plant parts of 
interest are derived from the ovary wall. These in- 
clude true berries such as tomato, grape, blueberry, 
cranberry, currant, and eggplant; stone fruits (drupes) 

30 such as cherry, plum, apricot, peach, nectarine and 
avocado; compound fruits (druplets) such as raspberry 
and blackberry. In hesperidium (oranges, citrus), the 
expression cassette might be expected to be expressed 
in the "juicy" portion of the fruit. In pepos (such as 

35 watermelon, cantelope, honeydew, cucumber and squash) 
the equivalent tissue for expression is most likely the 
inner edible portions, whereas in legumes (such as 
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peasr green beans / soybeans) the equivalent tissue is 
the seed pod. 

Identifying useful transcriptional initiation 
regions may be achieved in a number of ways. Where a 
fruit protein has been or is isolated r it may be par- 
tially sequenced f so that a probe may be designed for 
identifying messenger RNA specific for fruit. To fur- 
ther enhance the concentration of the messenger RNA 
specifically associated with fruit, cDNA may be pre- 
pared and the cDNA subtracted with messenger RNA or 
cDNA from non-fruit associated cells. The residual 
cDNA may then be used for probing the genome for com- 
plementary sequences, using an appropriate library 
prepared from plant cells. Sequences which hybridize 
to the cDNA may then be isolated, manipulated, and the 
S '-untranslated region associated with the coding re- 
gion isolated and used in espression constructs to 
identify the transcriptional activity of the 5* -un- 
translated region. In some instances, a probe may be 
employed directly for screening a genomic library and 
identifying sequences which hybridize to the probe. 
The sequences will be manipulated as described above to 
identify the 5 * -untranslated region. 

As an example, a promoter of particular inter- 
est for the subject invention, the fruit-specific tran- 
scriptional initiation region (promoter) from a ONA 
sequence which encodes a protein described as 2A11 in 
the Experimental section was identified as follows. 
cDNA clones made from ripe fruit were screened using 
cONA probes made from ripe fruit, green fruit, and leaf 
mRNA. Clones were selected having more intense hybrid- 
ization with the fruit DMAs as contrasted with the leaf 
cDNAs. The screening was repeated to identify a par- 
ticular cDNA referred to as 2A11. The 2A11 cDNA was 
then used for screening RNA from root, stem, leaf, and 
seven stages of fruit development after the mRNA was 
sized on gels. The screening demonstrated that the 
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particular message was present throughout the seven 
stages of fruit development. The mRNA complementary to 
the specific cDNA was absent in other tissues which 
were tested. The cDNA was then used for screening a 
5 genomic library and a fragment selected which hybrid- 
ized to the subject cDNA. The 5' and 3' non-coding re- 
gions were isolated and manipulated for insertion of a 
foreign sequence to be transcribed under the regulation 
of the 2A11 promoter. 

1*0; The cells which have been transformed may be 

grown into plants in accordance with conventional ways. 
See, for example, McCormick et al. , Plant Cell Reports 
(1986) ^:81-84. These plants may then be grown, and 
either pollinated with the same transformed strain or 

15 different strains, identifying the resulting hybrid 
having the desired phenotypic characteristic. Two or 
more generations may be grown to ensure that the sub- 
ject phenotypic characteristic is stably maintained and 
inherited and then seeds harvested for use to provide 

20 fruits with the new phenotypic property. 

A protein is provided having the sequence de- 
scribed in the Experimental section designated as.2All. 
This protein could be a storage protein and be useful 
in enhancing sulfur containing amino acids (cysteine 

25 and methionine) in the diet. It can be obtained in 

substantially pure form by providing for expression in 
prokaryotes or eukaryotes, e.g., yeast by inserting the 
open reading frame into an expression cassette contain- 
ing a transcriptional initiation region. A variety of 

30 expression cassettes are commercially available or have 
been described in the literature. See, for example, 
U.S. Patent Nos. 4,532,207; 4,546,082; 4,551,433; and 
4,559,302. The product, if intracellular, may be iso- 
lated by lysing of the cells and purification of the 

35 protein using electrophoresis, affinity chromatography, 
HPLC extraction, or the like. The product may be iso- 
lated in substantially pure form free of other plant 
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productSr generally having at least about 95% purity, 
usually at least about 99% purity. 

The following examples are offered by way of 
illustration and not by limitation. 

5 

EXPERIMENTAL 



Example 1 

Construction of Tomato Ripe Fruit cDNA Bank 
10 and Screening for Fruit-Specif ic Clones 

Tomato plants ( Lycopersicon esculentum cv 
UC82B) were grown under greenhouse conditions. Poly{A)'^RNA 
was isolated as described by Mansson et al . , Mol* Gen« 
Genet. (1985) 200:356-361. The synthesis of cDNA from 
15 polyCA)"** RNA prepared from ripe fruit, cloning into the 
PstI site of the plasmid pUC9 and transformation into 
an E. coll vector were all as described in Mansson et 
al.r Mol. Gen. Genet. (1985) 200:356-361. 

20 Library Screening 

Two thousand recombinant clones were screened 
by colony hybridization with radiolabeled cDNA made 
from tomato red fruit mRNA, immature green fruit mRNA, 
and leaf mRNA. Bacterial colonies immobilized onto 

25 GeneScreen Plus filters (New England Nuclear), were de- 
natured in 1.5 M NaCl in O.S M NaOH, then neutralized 
in 1.5 M NaCl in 0.5 M Tris-HCl pH 8, and allowed to 
air dry. Hybridization, washing and autoradiography 
were all performed as described in Maniatis et al. , 

30 Molecular Cloning: A Laboratory Manual (1982) Cold 
Spring Harbor, New York. 

Sixty-five clones were selected which had more 
intense hybridization signals with fruit cONA than with 
leaf cONA and therefore appeared to be under-represent- 

35 ed in the leaf mRNA population relative to the fruit 
population. Replicate slot blot filters were prepared 
using purified DNA from the selected clones and hybrid- 
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ized with radioactive cONA £rom leaf, green fruit, and 
red fruit as before. This allowed selection of cDNA 
clone 2A11, also referred to as pCGN1299 which is on at 
high levels in both the fruit stages (red and green) 
5: and off in the leaf. 

Example 2 
Analysis of Clones 
Synthesis of RNA Probes 

lO; The cDNA insert of pCGN1299 was excised as an 

Eco RI to Hindlll fragment of approximately 600 bp (as 
measured on an agarose gel), and subcloned into the 
Riboprobe vector pGEMl (Promega Biotec) , creating 
pCGN488. ^^P-labeled transcripts made from each strand 

15 of the pCGN488 insert using either SP6 or T7 polymerase 
were used as probes in separate Northern blots contain- 
ing mRNA from leaf, immature green and mature red 
fruits. The RNA transcript from the SP6 promoted did 
not hybridize to the tomato mRNA. However, the tran- 

20 script from the T7 promoter hybridized to an mRNA of 

approximately 700 nt in length from the green fruit and 
the red fruit but not to mRNA from tomato leaf. The 
direction of transcription of the corresponding mRNA 
was thus determined. 

25 The tissue specificity of the pCGN1299 cDNA 

was demonstrated as follows. RNA from root, stem, 
leaf, and seven stages of fruit development (immature 
green, mature green, breaker, turning, pink, light red, 
and red) was sized on formaldehyde/agarose gels accord- 

30 ing to the method described by Maniatis et al., (1982), 
immobilized on nitrocellulose and hybridized to ^^P- 
labeled RNA which was synthesized in vitro from pCGN488 
using T7 polymerase. Each lane contained 100 ng of 
polyA* RNA except for two lanes (pink and light red 

35 lanes) which contained 10 ug of total RNA. The 

Northern analysis of mRNA from root, stem, leaf, and 
various stages of fruit development indicated that 
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pCGN1299 cONA was expressed in all stages of £ruit 
development from the early stages immediately after 
anthesis to red ripe fruit. No mRNA hybridizing to 
pCGN1299 was found in leaf, stem, or root tissue. The 
5 size of the mRNA species hybridizing to the pCGN488 
probe was approximately 700 nt. 

Message abundance corresponding to the 
pCGN1299 cDNA was determined by comparing the hybrid- 
ization intensity of a known amount of RNA synthesized 

10 in vitro from pCGN488 using SP6 polymerase to mRNA from 
red tomato fruit in a Northern blot. The -^^P-labeled 
transcript from pCGN488 synthesized in vitro using T7 
polymerase was used as a probe. The Northern analysis 
was compared to standards which indicated that the 

15 pCGN1299 cDNA represents an abundant mRNA class in 
tomato fruit r being approximately 1% of the message. 

Example 3 
Sequencing of pCGN1299 and 
20 PCGN1298 cDNA Clones 

DNA Sequencing 

The polyA^ sequence was missing from pCGN1299 
cONA. A longer cDNA clone, pCGN1298r therefore was 
identified by its hybridization with the pCGN488 probe. 
25 The complete DNA sequence of the two cDNA inserts was 
determined using both Maxam-Gilbert and the Sanger di- 
deoxy techniques and is as follows. The sequence of 
pCGN1298 contains additional sequences at both the 5* 
and 3* end compared to pCGN1299. As shown in Figure 1, 
30 the sequences are identical over the region that the 
two clones have in common. 

Amino Acid Sequence 

The pCGN1299 cONA sequence was translated in 
35 three frames. The longest open reading frame (which 

starts from the first ATG) is indicated. Both pCGN1299 
and pCGN1298 have an open reading frame which encodes a 
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96 amino acid polypeptide (see Figure. 1). The protein 
has a hydrophobic N-terminus which may indicate a 
leader peptide for protein targeting. A hydrophobicity 
profile was calculated using the Hopp and Woods r (?roc. 
5 Natl. Acad. Sci, OSA (1981) 78:3824-3828) algorithm. 
Residues 10-23 have an extremely hydrophobic region. A 
comparison of 2A11 to pea storage proteins and other 
abundant storage proteins is shown in Figure 2. The 
sulfur-rich composite of the fruit-specific protein is 
10 similar to a pea storage protein which has recently 
been described (see Higgins et al . , J. Biol. Chem. 
(1986) 261:11124-11130, for references to the individ- 
ual peptides). This may indicate a storage role for 
this fruit-specific protein abundant species. 

15 

Example 4 
Screening Genomic Library 
for Genomic Clones 
Southern Hybridization 
20 Southern analysis was performed as described 

by Maniatis et al., 1982. Total tomato DNA from cult- 
ivar UC82B was digested with Eco RI or Hin dlll, sepa- 
rated by agarose gel electrophoresis and transferred to 
nitrocellulose. Southern hybridization was performed 
25 using a ^^P-labeled probe produced by nick translation 
of PCGN488 (Maniatis et al., 1982). The simple 
hybridization pattern indicated that the gene encoding 
pCGN1299 cDNA was present in a few or perhaps even one 
copy in a tomato genome. 

30 

Isolation of a Genomic Clone 

A genomic library established in Charon35/ 
Sau 3A constructed from DNA of the tomato cultivar VFNT- 
Cherry was screened using the (^^P]-RNA from cDNA clone 
35 PCGN488 as a probe. A genomic clone containing approx- 
imately 12.5 kb of sequence from the tomato genome was 
isolated. The region which hybridizes to a pCGN488 
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probe spans an Xba l restriction sice which was found in 
the cDNA sequence and includes the transcriptional 
initiation region designated 2A11. 

5 Sequence of Genomic Clone 

The DNA sequence of the genomic clone was 
determined by Sanger dideoxy techniques and is as shown 
in Figure 4. The sequence of the genomic clone is 
identical to the pCGN1299 cDNA clone over the region 
10 they have in common. 

Subcloninq 

The region surrounding the Xba l restriction 
site, approximately 2.4 kb in the 5' direction and ap- 

15 proximately 2.1 kb in the 3' direction was subcloned to 
provide an expression cassette. The 5* Xho l to Xba l 
fragment and the 3' Xba l to Eco RI fragment from the 
2A11 genomic clone were inserted into a pUC-derived 
chloromphenicol plasmid containing a unique Xho l site 

20 and no Xba l site. This promoter cassette plasmid is 
called PCGN1273. 

Example 5 
Construction of Fruit- 
25 Specific Antisense Cassette 

Insertion of Antisense Fraqmant 

The 2A11 genomic fragment was tagged with PG 
antisense sequences by insertion of PG into the unique 
Xba l site of the pCGN1273 promoter cassette in the anti- 
30 sense orientation. The inserted sequences increased 
the size of the mRNA over the endogenous transcript r 
and thus the expression pattern of the construct could 
be compared to the endogenous gene by a single Northern 
hybridization in a manner analogous to the detection of 
35 a tuber-specific potato gene described by Eckes et al . / 
Mol. Gen. Genets 1986 205:14-22. 
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Example 6 

Insertion of Tagged Genomic Construction 
Into Agrobacterium Binary Vectors 
The tagged genomic construction is excised 
5 using the flanking Xho l restriction enzyme sites and is 
cloned into the unique Sai l site of the binary plasmid 
pCGN783 containing a plant kanamycin resistance marker 
between the left and right borders to provide plasmid 
PCGN1269. 

10. This plasmid binary vector in E. coli C2110 is 

conjugated into A. tumefaciens containing a disarmed 
Ti-plasraid capable of transferring the polygalacturo- 
nase antisense cassette and the kanamycin resistance 
cassette into the plant host genome. 

15 The Agrobacterium system which is employed is 

A. tumefaciens PC2760 (G. Ooms et al., Plasmid (1982) 
7:15-29; Hoekema et al • , Nature (1983) 303:179-181; 
European Patent Application 84-200239.6, 2424183). 

20 !• Construction of pCGN783 

pCGN783 is a binary plasmid containing the 
left and right T-DNA borders of A. tumefaciens octopine 
Ti-plasmid pTiAS (Currier and Nester, J. Bacterid. 
(1976) 126 :157-165) the gent2unicin resistance gene of 

25 pPHlJl (Hirsch et al., Plasmid (1984) 12:139-141), the 
35S promoter of cauliflower mosaic virus (CaMV) 
(Gardner et al.. Nucleic Acid Res. (1981) 9:1871-1880); 
the kanamycin resistance gene of Tn5 (Jorgensen, Mol. 
Gen. (1979) 177 :65) ; and the 3* region from transcript 

30 7 of pTiA6 (Currier and Nester, supra (1976)). A sche- 
matic diagram of the construction of pCGN783 is shown 
in Figure 3. (a) through (f) refer to the plasmid 
constructions detailed below, 

35 
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(a) Construction of pCGN58 7 

The Hindlll-Smal fragment of Tn5 containing 
the entire structural gene for APHS'II (Jorgensen et 
al.r Hoi. Gen> (1979) 177:65), was cloned into pOC8 

5 (Vieira and Messing, Gene (1982) 19:259), converting 
the fragment into a Hindlll-EcoRI fragment, since there 
is an Eco RI site immediately adjacent to the Sma l site. 
The PstI -EcoRI fragment containing the 3* portion of 
the AFHB'II gene was then combined with an EcoRI-BamHI* 

0 Sall-PstI' linker into the EcoRI site of pUC7 (pCGN546W). 
Since this construct does not confer kanamycin resis- 
tance, kanamycin resistance was obtained by inserting 
the Bgll-PstI fragment of the APH3*II gene into the 
BamH I- Pst I site (pCGN546X). This procedure reassembles 

5 the APH3'II gene, so that Eco RI sites flank the gene. 
An ATG codon was upstream from and out of reading frame 
with the ATG initiation codon of APH3*II. The undesi- 
red ATG was avoided by inserting a Sau3A-PstI fragment 
from the 5* end of AFH3*II, which fragment lacks the 

0 superfluous ATG, into the BamH I- Pst I site of pCGN546W 
to provide plasmid pCGN5S0. The EcoRI fragment of 
pCGN550 containing the APH3*II gene was then cloned 
into the EcoR I site of pOC8-pUC13 (K. Buckley supra 
(1985)) to give pCGNSSl. 

5 Each of the EcoRI fragments containing the 

AFH3'II gene was then cloned into the unique Eco RI site 
of pCGN451, which contains an octopine synthase cas- 
sette for expression to provide pCGN548 (2ATG)) and 
tCGNSS2 (lATG). The plasmid pCGN451 having the ocs 5' 

0 and the ocs 3* in the proper orientation was digested 
with Eco RI and the Eco RI fragment from pCGNSSl con- 
taining the intact kanamycin resistance gene inserted 
with EcoRI site to provide pCGN552 having the kanamycin 
resistance gene in the proper orientation. This ocs/KAN 

5 gene was used to provide a selectable marker for the 
trans type binary vector pCGN587. 
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The 5' portion of the engineered octopine syn- 
thase promoter cassette consists of pTiA6 DNA from the 
JChoI at bp 15208-13644 (Barker et al. , supra {1983)), 
which also contains the T-DNA boundary sequence (border) 
5 implicated in T-DNA transfer. In the plasmid pCGN587, 
the osc/KAN gene from pCGN552 provides a selectable 
marker as well as the right border. The left boundary 
region was first cloned in M13mp9 as a Hindlll-Smal 
piece (pCGN502) (base pairs 602-2212) and recloned as a 

10 Kpn I-EcoRI fragment in pCGN565 to provide pCGN580. 

PCGN565 is a cloning vector based on pUC8-Cm, but con- 
taining pUCia linkers. pCGNSBO was linearized with 
BamHI and used to replace the smaller Bql l fragment of 
pVCK102 (Knauf and Nester, Plasmid (1982) 8:45), creat- 

15 ing PCGN585. By replacing the smaller Sai l fragment of 
PCGN585 with the Xhol fragment from pCGN552 containing 
the ocs/KAN gene, pCGN587 was obtained. 

(b) Construction of pCGN739 (Binary Vector) 

20 To obtain the gentamicin resistance marker, 

the resistance gene was isolated from a 3.1 kb Eco RI- 
Pstl fragment of pPHIJI (Hirsch et al. , Plasmid (1984) 
12:139-141) and cloned into pOC9 (Vieira et al.. Gene 
(1982) 19:259-268) yielding pCGN549. 

25 The PCGN549 Hin dlll-BamHI fragment containing 

the gentamicin resistance gene replaced the Hin dlll- 
Bql ll fragment of pCGN587 (for construction, see 
6.1(a), supra ) creating pCGN594. 

The PCGN594 Hindlll-BamHI region which con- 

30 tains an ocs-kanamycin-ocs fragment was replaced with 

the Hindi I I -BamHI polylinker region from pUC18 (Yanisch- 
Perron, Gene (1985) 33:103-119) to make pCGN739. 

(c) Construction of 726c (1 ATG-Kanamycin-3 ' region) 
35 PCGN566 contains the EcoRI-Hindlll linker of 

. pOC18 (Yanisch-Perron, ibid) inserted into the EcoRI- 
Hindlll sites of pOC13-Cm (K. Buckley, Ph.D. Thesis, 



wo 88/09334 



PCT/LiS88/01811 



University of California, San Diego, 1985). The Hindlll- 
B3III fragment of pNW31c-8, 29-1 (Thomashow et al . , Cell 
(1980) 19:729) containing ORFl and 2 (Barker et al. , 
Plant Mol. Biol. (1984) 2:335-350) was subcloned into 
5 the Hindlll-BamHI sites of pCGN566 producing pCGN703. 

The Sau3A fragment of pCGN703 containing the 
3' region of transcript 7 from pTiA6 (corresponding to 
bases 2396-2920 of pTil5955 (Barker et al. , supra (1984)) 
was subcloned into the BamHI site of pUC18 (Yanisch- 

10 Perron et al. , supra (1985)) producing pCGN709. 

The EcoAI-Smal polylinker region of pCGN709 
was replaced with the Eco RI- Sma l fragment from pCGN587 
(see 6.1(a), supra ) which contains the kanamycin resis- 
tance gene (APH3'II) producing pCGN726. 

15 The Eco RI- Sal l fragment of pCGN726 plus the 

Bqlll-Sall sites of pOC8-pUC13-cm (chloramphenical 
resistant, K. Buckley, Ph.D. Thesis, University of 
California, San Diego, 1985). producing pCGN738. To 
construct pCGN734, the Hindlll-SphI site of M13mpl9 

20 (Norrander et al . , Gene (1983) 2_6:101-106) • Using an 
oligonucleotide corresponding to bases 3287 to 3300, 
DNA synthesis was primed from this template. Following 
SI nuclease treatment and Hindlll digestion, the resul- 
ting fragment was cloned into the Hindlll-Smal site of 

25 P0C19 (Yanisch-Perron et al., supra (1985)). The re- 
sulting Eco RI to Hin dlll fragment of pTiA6 (correspon- 
ding to bases 3390-4494) into the EcoRI site of pUC8 
(Vieira and Messing, supra (1982)) resulting in pCGN734. 
pCGN726c is derived from pCGN738 by deletir.9 the 900 bp 

30 Eeo RI- Eco RI fragment. 

(d) Construction of PCGN167 

pCGN167 is a construct containing a full length 
CaMV promoter, 1 ATG-kanamycin gene, 3' end and the 
35 bacterial Tn903-type kanamycin gene. MI is an Eco RI 

fragment from pCGNSSO (see construction of pCGN587) and 
was cloned into the Eco RI cloning site in the 1 ATG- 
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kanamycin gene proximal to the polylinker region of 
M13mp9. See copending Application Serial No. 920,579, 
filed October 17, 1986, which disclosure is incorpora- 
ted herein by reference. 
5:3 To construct pCGN167, the Alu l fragment of 

CaMV (bp 7144-7735) (Gardner et al., Nucl. Acids Res. 

(1981) 9:2871-2888) was obtained by digestion with Alu l 
and cloned into the Hindi site of M13mp7 (Vieira, Gene 

(1982) 19:259) to create C614. An EcoRI digest of C614 
10 produced the EcoRI fragment from C614 containing the 

35S promoter which was cloned into the Eco RI site of 
pUC8 (Vieira et al.. Gene (1982) 19:259) to produce 
PCGN146. To trim the promoter region, the Bql ll site 
(bp 7670) was treated with Bqlll and Bal31 and subse- 

15 quently a Bqlll linker wis attached to the Bal31 trea- 
ted DNA to produce pCGN147. 

pCGN148a containing the promoter region, 
selectable marker (RAN with 2 ATGs) and 3' region was 
prepared by digesting pCGN528 (see below) with Bglll 

20 and inserting the BamHI-Bglll promoter fragment from 
PCGN147. This fragment was cloned into the Bglll site 
of PCGN528 so that the Bglll site was proximal to the 
kanamycin gene of pCGN528. 

The shuttle vector used for this construct, 

25 PCGN528, was made as follows. pCGN525 was made by 
digesting a plasmid containing Tn5 which harbors a 
kanamycin gene (Jorgenson et al., Mol. Gen. (1979) 
122:65) with Hindlll-BamHI and inserting the Hindlll- 
BamBI fragment containing the kanamycin gene into the 

30 Hindlll-BamHI sites in the tetracycline gene of 

PACYC184 (Chang and Cohen, J. Bacterid. (1978) 134: 
1141-1156). pCGN526 was made by inserting the BamHI 
fragment 19 of pTiA6 (Thomashow et al.. Cell (1980) 
19:729-739) into the BamHI site of pCGN525. pCGN528 

35 was obtained by deleting the small Xho l fragment from 
PCGN526 by digesting with Xhol and religating. 
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pCGN149a was made by cloning the BamH I kana- 
mycin gene fragment from pMB9KanXXI into the Bam HI site 
of pCGN148a, pMB9KanXXI is a pOC4K variant (Vieira and 
Messing, Gene (1982) 19:259-268) which has the Xho l site 
5 missing but contains a functional kanamycin gene from 
Tn903 to allow for efficient selection in Aqrobacter ium > 

pCGN149a was digested with Bqlll and Sghl. • 
This small Bql ll- Sph I fragment of pCGN149a was replaced 
with the Bam HI- Sph I fragment from MI (see below) iso- 
10 lated by digestion with Bam HI and Sph I, This produces 
PCGN167. 



(e) Construction of pCGN766c {35S prcmoter-3' region) 
The Hin dlll- BamH I fragment of pCGN167 contain- 

15 ing the CaMV-35S promoter, 1 ATG-kanamycin gene and the 
BamH I fragment 19 of pTiA6 was cloned into the BamH I- 
Hindlll sites of pOC19 (Norrander et al. , supra (1985); 
Yanisch-Perron et al., supra (1985)) creating pCGN976. 

The 35S promoter and 3' region from transcript 

20 7 was developed by inserting a 0.7 kb Hindlll-EcoRI 
fragment of pCGN976 (3SS promoter) and the 0.5 kb 
EcoRI-Sall fragment of pCGN709 (transcript 7:3' for 
construction see supra ) into the Hindlll-Sall sites of 
PCGN566 creating pCGN766c« 

25 

(f ) Final Construction of pCGN783 

The 0.7 kb Hindlll-EcoRI fragment of pCGN766c 
(CaMV-35S promoter) was ligated to the 1.5 kb EcoRI- 
Sal l fragment of pCGN726c ( l-ATG-KAN-3 • region) into 

30 the Hindlll-Sall sites of pUC119 (J. Vieira, Rutgers 

University, New Jersey) to produce pCGN778. The 2.2 kb 
region of pCGN778, Hindlll-Sall fragment containing the 
CaMV 35S promoter ( l-ATG-KAN-3 ' region) replaced the 
Hin dlll-Sall polylinker region of pCGN739 to produce 

35 PCGN783. 
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Example 7 
Transfer of Genomic Construction 
to Tomato via Cocultivation 
Substantially sterile tomato cotyledon tissue 
5 is obtained from seedlings which have been grown at 24*^0/ 
with a 16hr/8hr day/night cycle in 100x25 mm petri dishes 
containing Murashige-Skoog salt medium and 0.8% agar 
(pH 6.0). Any tomato species may be used^ however, 
here the inbred breeding line was OC82B, available from 

10 the Department of Vegetable Crops, University of Cali- 
fornia, Davis r CA 95616. The cotyledons are cut into 
three sections and the middle placed onto feeder plates 
for a 24-hour preincubation. The feeder plates are pre- 
pared by pipetting 0.5 ml of a tobacco suspension cul- 

15 ture (10^ cells/ml) onto 0.8% agar medium, containing 
Murashige minimal organic medium (K.C. Biologicals) , 
2,4-D (0.1 mg/1), kinetin (1 mg/1), thiamine (0.9 mg/1) 
and potassium acid phosphate (200 mg/1, pH 5.5). The 
feeder plates are prepared two days prior to use. A 

20 sterile 3 mm filter paper disk containing feeder medium 
is placed on top of the tobacco cells after the suspen- 
sion cells are grown for two days.' 

Following the preincubation period, the middle 
one third of the cotyledon sections are placed into a 

25 liquid MG/L broth culture (1-5 ml) of the A. tumefaci- 
ens strain. The binary plasmid pCGN1269 is transferred 
to A. tumefaciens strain 2760 by conjugation or by 
transformation selecting for Gentamicin resistance en- 
coded by the plasmid pCGN1269. The cotyledon sections 

30 are cocultivated with the bacteria for 48 hrs on the 
feeder plates and then transferred to regeneration 
medium containing 500 mg/1 carbenicillin and 100 mg/1 
kanamycin. The regeneration medium is a K.C. Biologi- 
cals Murashige-Skoog salts medium with zeatin (2 mg/1) 

35 myo-inositol (100 mg/1), sucrose (20 g/1), Nitsch vita- 
mins and containing 0.8% agar (pH 6.0). In 2-3 weeks, 
shoots are observed to develop. When the shoots are 
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approximately 1.25 cm, they are excised and transferred 
to a Murashige and Skoog medium containing carbenicil- 
lin (500 mg/1) and kanamycin (50 mg/1) for rooting. 
Roots develop within 10-12 days. 
5 Shoots which develop and subsequently root on 

media containing the kanamycin are tested for APH3*II 
enzyme. 

An aminoglycoside phosphotransferase enzyme 
(APH3*II) assay is conducted on putative transformed 

10 tomato plants and shoots. APHB'II confers resistance 

to kanamycin and neomycin. APH3*II activity is assayed 
(Reiss et al.. Gene (1984) 30(2211-218) employing elec- 
trophoretic separation of the enzyme from other inter- 
fering proteins and detection of its enzymatic activity 

15 by in situ phosphorylation of kanamycin. Both kanamy- 
cin and Iy^^^P) ATP act as substrates and are embedded 
in an agarose gel which is placed on top of the poly- 
acrylamide gel containing the proteins. After the 
enzymatic reaction, the phosphorylated kanamycin is 

20 transferred to P-81 phosphocellulose ion exchange paper 
and the radiolabeled kanamycin is finally visualized by 
autoradiography. The Reiss et al. method is modified 
In the final washing of the P-81 ion exchange paper by 
rinsing in 0.1 mg/ml of proteinase R. 

25 

Example 8 

Construction of Tagged 2A11 Plasmids 

In Binary Vectors 
The complete sequence of the 2Alx genomic DNA 
30 cloned into pCGN1273 from the Xhol site (position 1 at 
the 5' end) to the Eco RI site (position 4654) is shown 
in Figure 4. 

pCGM1267 was constructed by deleting from 
pCGN1273 a portion of the plasmid polylinker from the 
35 Eco RV site to the Bam HI site. Two DNA sequences were 
inserted into pCGN1273 at the unique Xbal site (posi- 
tion 2494). This site is in the 3* non-coding region 
of the 2A11 genomic clone before the poly A site. 
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pCGN1273 was tagged with 360 bp (from base 
number 1 to 360) from the 5' region of the tomato poly- 
galacturonase (PG) cDNA clone, Fl (Sheehy et al. , Mol. 
Gen. Genet. (1987) 208:30-36) at the unique Xba l re- 
5 striction enzyme site. The tag was inserted in the 

antisense orientation resulting in plasmid pCGN1271 and 
in the sense orientation yielding plasmid pCGN1270. 
Each plasmid was linearized at the unique Bqlll re- 
striction enzyme site and cloned into the binary vector 
10 pCGN783 at the unique BamH I restriction enzyme site. 

pCGN1273 was also tagged with a 0.5 kb frag- 
ment of DNA (base number 1626 to 2115) from a PG geno- 
mic clone (see Figure 5) which spans the 5* end of the 
intron/exon junction. This fragment was cloned into 
15 the Xba l site resulting in plasmid pCGN1215. pCGN1215 
was linearized at. the unique Bql ll site and cloned into 
PCGN783 at the BamHI site resulting in two plasmids, 
PCGN1219 and pCGN1220, which differ only in the orien- 
tation of pCGN121S within pCGN783. 
20 Three DNA sequences were inserted into pCGN1267 

at the unique Cla l sites (position 2402, 2406). These 
sites are in the 3' non-coding region of the 2A11 geno- 
mic clone, 21 bp from the stop codon. The 383 bp Xba l 
fragment from the PG cDNA clone was cloned into the 
25 Cla l site of pCGN1267 after filling in the Xba l and 

Cla l ends with Rlenow and blunt ligation. The fragment 
in a sense orientation resulted in plasmid pCGN1263 and 
in the antisense orientation gave pCGM1262. pCGN1263 
was linearized at the unique BqlH site und cloned into 
30 PCGN783 at the BamHI site yielding pCGN1260. pCGN1262 
was also linearized at the Bqlll site and cloned into 
PCGN783 at the BamHI site resulting in two plasmids, 
pCGN12S5 and pCGN1258, which differ only in the orien- 
tation of pCGNl262 in the binary vector pCGN783. 
35 The 0.5 kb fragment of the PG genomic clone 

spanning the intron/exon junction ( supra ) was cloned 
Into PCGN1267 at the Clal site in an antisense direc- 
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tion yielding plasmid pCGN1225. This plasmid was 
linearized at the Bql ll restriction enzyme site and 
cloned into pCGN783 at the BamHI site producing two 
plasmids, pCGN1227 and pCGN122ar which differ only in 
5 the orientation of pCGN1225 in the binary vector. 

The Eco7 fragment (base numbers 5545 to 
12,823) (Barker et al.. Plant Mol. Biol. (1983) 2:335- 
350) from the octopine plasmid pTiA6 of A. tumefaciens 
(Knauf and Nester, Plasmid (1982) 8:45-54) was sub- 

10 cloned into pOC19 at the EcoRI site resulting in plas- 
mid pCGN71. A Rsal digest allowed a fragment of DNA 
from bases 8487 to 9036 of the Eco 7 fragment to be sub- 
cloned into the vector ml3 BlueScript Minus (Strata- 
gene, Inc.) at the Sma l site resulting in plasmid 

15 PCGN1278. This fragment contains the coding region 
of the genetic locus designated tmr which encodes a 
dimethylallyl transferase (isopentenyl transferase) 
(Aklyoshi et al . , Proc. Natl. Acad. Sci. USA (1984) 
81:5994-5998; Barry et al. , ibid (1984) 81:4776-4780). 

20 An exonuclease/mung bean treatment (Promega Biotech) 
produced a deletion on the 5* end of the tmr gene to a 
point 39 base pairs 5* of the start codon. The tmr 
gene from pCGN1272 was subcloned into the Clal site of 
PCGN1267. The tmr gene in the sense orientation 

25 yielded pCGN1261 and in the antisense orientation gave 
plasmid PCGN1266. pCGN1261 was linearized at the Bglll 
site and cloned into pCGN783 at the BamHI site result- 
ing in plasmid pCGN1254. pCGN1266 was also linearized 
at the Bqlll site and subcloned into pCGN783 at the 

30 BamHI site yielding two plasmids, pCGN1264 and pCGN1265, 
which differ only in the orientation of pCGN1266 in 
PCGN783. 



Analysis of Expression in Transgenic Plants 
35 Immature green fruit (approximately 3.2 cm in 

length) was harvested from two tomato plants cv. UC82B 
that had been transformed with a disarmed Agrobacterium 



28 

Strain containing pCGN1264. Transgenic plants are 
designated 1264-1 and 1264-11. The pericarp from two 
fruits of each plant was ground to a powder under liq- 
uid Nj, total RNA extracted and polyA* mRNA isolated 
(as described in Mansson et al., Mol» Gen. Genet. 
(1985) 20£- 356-361) . Young green leaves were also 
harvested from each plant and polyA* mRNA isolated. 

Approximately 19 vg of total RNA from fruit, 
70 ng of polyA* mRNA from fruit and 70 ng of polyA* 
mRNA from leaves from transformed plants 1264-1 and 
1264-11 was run on a 0.7% agarose formaldehyde Northern 
gel and blotted onto nitrocellulose (Maniatis et al.. 
Molecular Cloning: A Laboratory Manual (1982) Cold 
jSpring Harbor, New York). Also included on the gel as 
a negative control was approximately 50 ng of polyA* 
mRNA from leaf and immature green fruit of a nontrans- 
formed OC82B plant. 

As a positive control and to help in quanti- 
tating mRNA levels, in vitro transcribed RNA from 
PCGN1272 was synthesized using T3 polymerase (Strata- 
gene, Inc.). Nineteen pg and 1.9 pg of this in vitro 
synthesized RNA were loaded on the Northern gel. 

The probe for the Northern filter was the 
1.0 kb tmr insert DNA (a Kpn l to Sad fragment) from 
pCGN1272 isolated by electroelution from an agarose gel 
(Maniatis, supra (1982)) and labeled by nick transla- 
tion (Bethesda Research Laboratory kit) using a^^p aCTP 
(Amersham) . 

The Northern filter was prehybridized at 42*C 
for 5 hrs in the following solution: 25 ml formaraide, 
12.5 ml 20X SSC, 2.5 ml 1 M NaP, 5 ml SOX Oenhardts, 
0.5 ml 10% SDS, 1 ml 250 mM EDTA, 1 ml 10 mg/ml ssONA 
and 2 ml H2O. Then one-fifth volume of 50% dextran 
sulfate and approximately 2.2X 10^ cpm of the probe was. 
added and hybridization was for 15 hrs at 42**C. 
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The Northern filter was washed one time in 2X 
SSC and 0«1%'SDS at 55**C for 20 minutes each wash. The 
filter was allowed to air dry before being placed with 
Kodak XAR film and an intensifying screen at -70^ for 
5 two days. 

Northern Results on Transgenic Plants 

The nicked tmr probe hybridized with a mRNA 
species approximately 1.7 kb in length was observed in 

10 the total RNA and polyA* mRNA fruit lanes of the 

Northern blot. This is t'-e expected length of the 
reintroduced 2A11 gene (0.7 kb) tagged with the tmr 
gene (1.0 kb) in the antisense orientation. The level 
of expression from the reintroduced tagged gene is 

15 somewhat lower than the level of expression of the 

endogenous 2A11 gene. The level of expression of the 
reintroduced gene in immature green fruit is higher 
than the expression level in leaf tissue with a small 
amount of hybridizing mRNA in leaf tissue in these 

20 transformants. 

Example 9 
Screening Genomic Library for 
Polygalacturonase Genomic Clones 
25 Isolation of a Genomic Clone 

An EcoRI partial genomic library established 
in Charon 4 constructed from DNA of a Lycopersicon 
esculentum cultivar was screened using a probe from the 
polygalacturonase cDNA (Sheehy et al . , Mol. Gen. Genet. 
30 (1987) 2£8:30-36). A lambda clone containing an ap- 
proximately 16 kb insert was isolated from the library, 
of which an internal 2207 bp Hindlll to EcoRI was se- 
quenced. The Hindlll-EcoRI fragment includes the poly- 
galacturonse promoter region. 



35 
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Sequence of Genomic Clone 

The DNA sequence of the genomic clone was 
determined by Sanger dideoxy techniques and is as shown 
in Figure 5. The sequence of the genomic clone bases 
5 1427 to 1748 are homologous to the polygalacturonase 
cDNA sequence. 

The above results demonstrate the ability to 
identify inducible regulatory sequences in a plant 

10 genome, isolate the sequences and manipulate them. In 
this way, the production of transcription cassettes and 
expression cassettes can be produced which allow for 
differentiated cell production of the desired product. 
Thus, the phenotype of a particular plant part may be 

15 modified, without requiring that the regulated product 
be produced in all tissues, which may result in various 
adverse effects on the growth, health, and production 
capabilities of the plant. Particularly, fruit-specific 
transcription initiation capability is provided for mo- 

20 difying the phenotypic properties of a variety of fruits 
to enhance properties of interest such as processing, 
organoleptin properties, storage, yield, or the like. 



E. coli strain pCGH1299x7118 was deposited 
25 with the American Type Culture Collection (A.T.C.C.), 
12301 Parklawn Drive, Rockville, Maryland, 20852 on 
May 21, 1987 and given Accession No. 67408. 



All publications and patent applications men 
30 tioned in this specification are indicative of the 
level of skill of those skilled in the art to which 
this invention pertains. All publications and patent 
applications are herein incorporated by reference to 
the same extent as if each individual publication or 
35 patent application was specifically and individually 
indicated to be incorporated by reference. 
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Although the foregoing invention has been de- 
scribed in some detail by way .of illustration and ex- 
ample for purposes of clarity of understanding, it will 
be obvious that certain changes and modifications may 
5 be practiced within the scope of the appended claims. 
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WHAT IS CLAIMED IS; 



1. A DNA construct comprising in the direction of 
transcription, a fruit-specific transcriptional initia- 

5 tion region from a gene expressed at or immediately 
after anthesis or at the breaker stage, said gene 
remaining expressed at least until the ripe period, 
joined to a DNA sequence of interest other than the 
wild-type sequence associated with said initiation re- 
10 gion, wherein said DNA sequence of interest is under 
the transcriptional regulation of said initiation 
region, and a transcriptional termination region. 

2. A DNA construct according to Claim 1, wherein 
15 said transcriptional initiation region is frdm a gene 

expressed immediately upon anthesis. 

3. A DNA construct according to Claim 1, wherein 
said transcriptional initiation region regulates tran- 

20 scription of a gene encoding a plant storage protein. 

4. A DNA construct according to Claim 3, wherein 
said transcriptional initiation region is the 2A11 
region. 

25 

5. A DNA construct according to Claim 1, wherein 
said DNA sequence of interest is a sequence complemen- 
tary to a native plant transcript. 

30 6. A DNA construct according to Claim 1, wherein 

said DNA sequence of interest is an open reading frame 
encoding an amino acid sequence of interest. 

7. A DNA construct according to Claim 1, wherein 
35 said DNA sequence of interest is a polygalacturonase 
gene or fragment thereof of at least 12nt in the anti- 
sense direction. 
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8. A DNA construct for integration into a plant 
genome comprising at least the right T-DNA border 
joined to a DNA construct according to Claim 1. 

5 

9. A t>NA construct comprising in the direction of 
transcription, che fruit-specific transcriptional ini- 
tiation region of a plant storage protein being active 
at or immediately after an^hesis and remaining active 

10 until at least until the ripe period, joined to a DNA 
sequence other than the wild-type sequence, wherein 
said sequence comprises a unique restriction site for 
insertion of a sequence of interest to be under the 
transcriptional regulation of said initiation region, 

15 and a transcriptional termination region. 

10. A DNA construct according to Claim 9, wherein 
said transcriptional initiation region is the 2A11 
region. 

20 

11. A ONA construct for integration into a plant 
genome comprising at least the right T-DNA border 
joined to a DNA construct according to Claim 10. 

25 12. A DNA vector comprising a broad spectrum 

pcokaryotic replication system and a DNA construct 
according to Claim 1. 



13. A DNA vector comprising a broad spectrum 
30 prokaryotie replication system and a DNA construct 

according to Claim 9. 

14. A method for specifically modifying the pheno- 
type of fruit substantially distinct from other plant 

35 tissue, said method comprising: ' 

i) 
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transforming a tomato plant cell with a DNA 
construct under genomic integration conditions r wherein 
said DNA construct comprises in the direction of tran- 
scription, a 2A11 fruit-specific transcriptional initi- 
5 ation region, joined to a DNA polygalacturonase gene 
sequence/ wherein said sequence is oriented in the 
antisense direction and under the transcriptional regu- 
lation of said initiation region, and a transcriptional 
termination region, whereby said DNA construct becomes 

10 integrated into the genome of said plant cell, whereby 
said antisense sequence is transcribed and inhibits 
expression of polygalacturonase in fruit; 

regenerating a plant from said transformed 
plant cell; and 

15 growing said plant to produce fruit of the 

modified phenotype. 

15. A method according to Claim 14, wherein said 
transcription initiation region is the 2A11 region. 

20 

16. A plant cell comprising a DNA construct 
according to Claim 1. 

17. A plant cell comprising a DNA construct 
25 according to Claim 9. 

18. A method for specifically modifying the pheno- 
type of tomato fruit substantially distinct from other 
plant tissue, said method comprising: 

30 transforming a plant cell with a DNA construct 

under genomic integration conditions, wherein said DNA 
construct comprises in the direction of transcription, 
a fruit-specific transcriptional initiation region be- 
ing active at or immediately after anthesis, said gene 

35 remaining active at least until the ripe period, joined 
to a DNA sequence other than the wild-type sequence and 
capable of modifying the phenotype of fruit cells upon 
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transcriptionr wherein said sequence is under the tran- 
scriptional regulation of said initiation region^ and a 
transcriptional termination region, whereby said DMA 
construct becomes integrated into the genome of said 
5 plant cell; 

regenerating a plant from said transformed 
plant cell; and 

growing said plant to produce fruit of the 
modified phenotype. 

10 

19. A plant comprising a DNA construct according 
to Glaim 1, 

20. A plant comprising a DNA construct according 
15 to Claim 9. 

21. Fruit comprising a construct according to 
Claim 1. 

20 22. Fruit according to Claim 21, wherein said 

fruit is tomato. 

23. Fruit according to Claim 22, wherein said DNA 
sequence of interest is a polygalacturonase gene or 

25 fr%tient of at least 12nt thereof oriented in the anti- 
sense direction and said transcsription initiation 
region is 2A11. 

24. Fruit according to Claim 21, wherein said 
30 transcription initiation region is 2A11. 
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3H11 
2A11 



sail 

2AI1 



//r 



3ail Tr:tr:TGAGCAAACG<;CAACtCAGA7ATCCAAAGATC^ 

3811 ATtGTCTAACTCTTCrGAAATttAAATCrrAtCCAGAATCCtTCAXCATAAAAXAA^ 133 

3E11 ATCTATnTrrCtACtCTtCTCTAGCTTCAACrrrcrrCTTCTGCTCATCAATTA^ 20? 

TGCXCATCAATTAGCAATtAATSIiaAaL 



ACCXTTATGgCTCCCAAAAATTCAGAGATCAACTTTGCTATCTTCTTC q iT G I r c ? I TTSACGACCAC? 2 7 i 
ACCAT?ATggC7gCSAAAAATTC^ACA7GAAGTTTGCTATCTTCTTCS77C7?C?TrrCACGACCACT 
METAi4Ai*LysAjnS«rGiu>2tI.ysPhtAlAll«Phe?h*VAlValL«uL«uThrthrThr 



3ail TTAGTTCATATCTCTGGAArrrCGAAAATGCAAGTCATGGCTCTTCSAGACATACCCCCACAAGAAACA 34 5 
2 Al 1 TTAGTTGATATGTCTGGAATTtCGAAAATGCAACTGATGGCTCTTCGAGACAIACCCCCACAAGAAACA 
t«uVaiA3pKSTS«rGlytl«ScrlysM£T6inVaiMETAl«I.«uArgA3pIl«ProProGinCluTar 



TTGCTGAAAATGAACCXACTTCCCACAAATATTTTGGGACTTTGTAACGAACCTTCCACCTC^ 414 
TTGCTGAAAATGAACCTACTTCCCACAAATATTTTCGGACrrTCTAACGAACCrrGCACCTCAA^ 
L«ttI.«uLy5METLy8L«uX,«uProThrA3aIl«L«uCiyt«uCyaA3nGluProCysS«rS«rA3nS«r 

2?H 5^"5»TCGGAATTACCCTrrCCCAATTTrCtAACGAGAA6ACGGACaU^ 403 
2A11 »tTCCATCGGAAXTACCCWTCCCAArrTTGTAAC<ykCAACACCGACCACTAXGCTTTAACAtAC 

A3pCysIl«GlyXl«TlirL#ttCysGlnPh*CystysGlut.y3ThrAjpGlaTyrGlyt«uThrTyrAr7 

*«J«JACCT«TCCCCT 352 

2A11 ACAIGCAACCTGTTGCCTTGAACAAXAICAAXGATCTATCGATCGATCTATCTAICTAXTTATCTGTCT 
7hzCysAaaL«ttL«uffro • 

3H11 CTGCGCGTATAOTCTOTCTtrCACCmGCTGTGAAGAAXATGAA^ $21 
2A11 CTCCCCGTATACTGTTgTCTGTACCTTTGGTgTGAAGAAIATGAAaAAAGGGAIACAlAX tfCTAGAg 

3fllX TATTraUWTAATOTCCTATTCTATTTXAAArrTCTACW^ 690 
2A11 TATTCTACCTAAtGTCCTATTGTArTTAAAATTTCTACCAATGAtTCTTTGAAtAAAAAC^ 

3ail CTSAAAIAArrATrCCACATTAATTCACGTATTTATrTCACTtATCATACCT A T; r X T G IT CC TT t CGC 753 
lAlX GTGAAAXAATtXRCC 

38X1 GtAAAAAAAAAAAAA 774 
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ZhXX CeKCHtC 
10 
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80 



30 

TTAAAAACTA 
90 



AAAACAAATC ATCATCACAT 



ISO 
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TACTTTTTCT TAAAACTAAA 



230 



330 



ATCTGCAAAC TTTACAAATA 



290 



300 



AACACTCTCA CATAAAATAA 

3C0 370 
CTAGAACATC ATCTACACTT 

430 440 
AATAAA7TAr TTCCATATTA 

SOO 510 
TATAACTCtQ CTCAAACTAA 

970 980 
CATMTACTC ACAAC7AAAA 

€40 890 
ATTATCAATA TCCTTCAATC 

710 730 
CAATATtAAA GATAAAACAA 

780 790 
ATAACTATCA TCAACTCTAT 

890 880 
AACCATACAA AAGATCACTC 

930 930 
AOeXTACTAG AATAATACTC 



30 

TACTCAATAT 
100 

TCTACTGA7G 
170 

AAAAAATCTC 
240 

CCtAATACAA 
310 

TTATCACTAT 
380 

CAATAAAACT 
490 

TATTAACTTA 
920 

ACNACTTCTT 
990 

TATAAAATAC 
880 

CTACGTTAAT 
730 

CAATAAAAAT 
800 

ACAATATAAA 
810 

ATAAATATCA 
940 

A7TAAAAAAA 



40 

TTACMCTCAC 
110 
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180 
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290 

CATTTTCACC 
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390 

AACACCATAA 
4 80 
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800 
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740 
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990 

CCGCTTATCT 



30 
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80 
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300 
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70 
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990 1000 1010 1020 1030 1040 lOSO 

CATAATTAGT 6ACCATAAAT TTTTTTAAAA AAATCCACAT VTACACTATA ATATTTTATA ACACTTTCCC 

1080 1070 1O80 1090 1100 1110 1130 

TTAAACATCT AGGTArAAAt AA76ACTCTT GTCAAAATCT TACTA66AAA AAtTCTCTCA AATTtttTTA 

1130 1140 1190 1180 1170 1180 1190 

GtGAAAACAA ATGATATAAA TATCITCAAT ACTCATTATT TCTTCTCTCA TTAAAAATCT tATCTCACCT 

1300 1310 1330 1330 1340 1390 1380 

ATAAAATAAA TTATTTCCTC AACtCAAAAT ACTtTTTCAT TCTAAAATTA CTATAAmX TACTCAATAT 

12 1380 1390 1300 1310 1330 1330 

TTAATTAACA TAATTCTATA CTAAC66CCC 7ATAAATTGC ATTCTTCTCA AAGAAAAATA AAATCACCAC 

1340 1390 1380 1370 1393 

ACAACTTTCT TC r r crCC T C ATCAATTACC AAT7AATCCA AAACCATT ATC CCT GCC AAA AAt 

HCT AlA Alt tyt Asa 
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1<« UU 10« X4S4 

TCA CAC ATC AAC tTT CCT ATC TTC TTC CTT CIT CTT TTC ACC ACC ACT TTA CCTTCACaXc 
S«r Clw MtT lyt Ph« Al* XU Ph« Pha V«i v«l Uu Uu Thr Tht Thr Uu 

1484 XiU XS04 1514 1S24 

ACTTCTCCCT TATTXTCTTT TCTTAATTTC tlCCAACTCA TATCCAXCTC TTTCCTATCA TCCTATATAT 

1534 15«4 X374 1584 1594 

ATAAACCAAA ATATTlTTCt TAATTACTCC TXTTCTAATS TTTCCTACCT AATCCCAAAT TATTATGACA ' 
i«< 1«4 1834 1844 1854 1<€4 • 

TAATCAACTT CCAAACTCAT TATTATATAA CTTTTtTTTT ATACTTTCAT tXAACAATTC ATTTTTCTCA * 
1«4 1894 1704 1714 1724 1734* 

mtATATAA ACTTATTTTT CAACACAAAA TATTITTCCA ACTATTCAAA CACACCCTAA GACATTACAT 
. 1744 1754 1784 177< 1784 1794 1804 

ATATATATAT ATACACCCTC CCTTTTAtAT TACTTAAtCC CTATTGACTT CCCCCACCCT TTAACAATCA \ 
"14 1824 1834 1844 1854 1884 1874* 

TtCAATTACA GATATCITTT ACtAAATTAA CCtATCCTTT AACACTCTAA ATrtCCCTAT TACTATTTTA 

^^"^ 1W4 1914 1924 1934 1944 

CCTTCtAATT TAATGACAAA CATTTCATAA TCACTATACT CTCAACTTAA TTACACACAC OTATCTATAC 
i>« 1974 • 1984 1994 2004 2014* 

mCCTTACT AATCATTCAT ACCtATATAT TTCCACACCA CACACACAAA CCATATTAAC AAA«CACCA 
2034 2044 2054 2084 2074 2004 

CACACCCCAG OTAAATCTCA AATACACAAC ACAAAGCCAA CCAATTTTCA TCATCTATCA tACTtTTCAT 
2094 2104 2114 2124 2134 2144 2154 

TATTATTTTT ATTATATOtA CCTTTACATT ACACTTTTCC AATTCTTACA TTAATCTTAA TCATAATATA 

2173 2188 2203 

tACA CTT CAT ATC tCT CCA ATT TCC AAA ATC CAA CTC ATC CCT CTT CCA CAC ATA 
V*l A«p MET Uv Cly Il« Ut lyt HtT Clu V«l HtT Aim Ua kzq Atp tlm 
2233 3248 2283 

CCC CCA CAA CAA ACA TTC CTC AAA ATC AAC CTA CTT CCC ACA AAT ATT TTC CCA 
fro Pro Cln Giu Thr Uu Uu lys HtT Lys Uu Uu Pro Tht Atn ZU Uu Cly 

2278 2293 2308 

CTT TCT AAC CAA CCT TCC ACC TCA AAC TCT CAT TCC ATC CCA ATT ACC CTT TCC 
Uu Cyi Asa Clu ?to Cy* Sar Sor A«n for A«p Cyt Zlo Cly Xlo Tht L«tt Cya 

2323 3338 23S3 2380 

CAA TTT TCT AAC CAC AAC ACC CAC CAC TAT CCT TTA ACA TAC CCT ACA TCC AAC 
Clfi Ph« Cyt Lyt Clu ty« Tht Asp Cln Tyr Gly Uu Thr Tyr Ar^ Tht Cyt Atn 

2383 2393 2403 24X3 2423 3433 

CTC TTC CCT TCA ACAATATCAA TCATCTATCC ATCCATCTAT CTATCTATTT ATCTCTCTCT 
Uu Uu Pro • 

240 2453 2483 2473 2483 2493 3503 

CCCCCTATAG TCTTOTCTCT ACCTHCCTC TCAACAATAT GAATAAACCC ATJftATATAT CTACATATAT 



2513 



2523 



2533 



2543 



2553 



2583 



2573 



TCTACCTAAT CTCCTATTCT AXTTAAAATT TCTAOCAATC ATTCTTTCAA TAAAAACATA CCATCACTcV 
2583 2593 2803 2813 2823 2833 2843 

AATAATTATT CCACATTAAT TCACCTATTT ATTTCACTTA TCATACCTAT TTTTCTtCCt TTCCCCTACA* 
2853 2883 2r73 2883 2f93 2703 2713 

tTTTTCATCC TTTTCCCTTT TCAATATTAA ACATTAAACA CAAATAATGT TTATTAAATT AAOTTAATAT* 
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a72J 2733 2743 2733 27«3 2773 2783 

TTTTATTTAC CTATTTXTAT TTTTATTTCA AATCAAACTT CATAAATATT TATAAACATA ATTAACAACT 

2793 2a03 2113 3823 2S33 2843 28S3 

AATCTCACAC TAACACCATC TAATAttAtC trCTCCITAT TTATCAIAAT ATTTTAAAAT TATAATTTCA 

28«3 2873 2883 3893 2903 2913 2923 

CTTAAAAAAT TATTAAAAAA ACATACTTTT AAAAACtCAC TTACCCTCCG CTACCCACAT ACTTATCAAT 

2933 2943 2933 2983 2973 2983 2993 

TCCACTAOTT OTTtTTTCAC CCACAAAAAG AAtCCCCTAA TTAAACCTCA CCTATCAAAT TTCACAATCT 

3003 2013 3023 3033 3043 30S3 3083 

CCATACATTA CTCCOAACCA AATCACTCAC CCCCTATTCA ACAAAATATC AACAACCACO tXAtCTAAAC 

3073 3083 3093 3103 3113 3123 3133 

ATCTTTAACA ACCAAAAAAO ATnCXAAIA CATATCCACT TTCAATATCC CAACTTtOtC TCCCCATCTC 

3143 3133 3183 3173 3183 3193 3203 

AACCCTCCTT ACTTTCTTCA TCATTAACTT CTCTTCCTAT CTATTTAACA TTTAAACrTT ATATCTTTAA 

3213 3223 3233 3243 3233 3283 3273 

ACTTACACAA AATACATATA AATCTCTCAA CACTTCCCAA CA7AAITTAC TTTACTACTT AAACTACATC 

3283 3293 3303 3313 3323 3333 3343 

AAAATtTAAA lATCCtTTTA ACATCTTTCA ACTCAATTAA ATTATCACAA TCCCACCCTA CACCTtCCAC 

3353 3383 3373 3383 3393 3403 3413 

CTCCCCCCCA CTCAACAACC ACTCCTCCtC CCCAACCTAA CCCTCATCCT CACTCACTAC AACCCCAACG 

3423 3433 3443 3433 3483 3473 3483 

CTAACTTAAC TATACAAAAC CTTAAAACTC AATAAAATAA ACTTTACAAG CtTTTAACAC AAATCAACAA 

3493 3303 3313 3323 3533 3543 3553 

CTTTCAACAA AATAATATAT TCAACTACCC ATAAAATAGA CAACTTTA6T CTTTAAAACA TTTAATAAAA 

3583 3573 3383 3593 3803 3813 3823 

tAAAtCCAAA ATATACACTC CtTAACTAAA CTGACTATCT AIGCACCCTC TAATtGATAA A6ATGCAACT 

3833 3843 3833 3883 3873 3883 3893 

CCCCACAACA CCACCACATC CtCACTAAAC TCACAACTAA ATAAAATCCC CCCGAAAAAA ACCA6 C CTCA 

3703 3713 3723 3733 3743 3753 3783 

CCATGCCTXA CTCCAACTCC CC6ATATATC AAtCAACCTC CTCTTGAtCA TCTT6AACAC ATCTCTCTGC 

3773 3783 3793 3803 3813 3823 3833 

ATCATCAAAA AOATGCAGCC CAAAt CCC t C ACTACCTAAA AIGTACGAOt ATOTAAOCOA AAtTCTAAAO 

3843 3853 3883 3873 3883 3893 3903 

tATAACATAA CCnOATACT TGAATAAAAC CAAACATAC7 TACCtCmt CAACTCAACt CAAATTAAOA 

3913 3923 3933 3943 3953 3983 3973 

ATAACA^ACt CAACtCAAAC AXTACCTATr CAACOCAAAT ATCCCACTCT ACTCAATCAA CTACAAATTA 

3983 3993 4003 4013 . 4033 4033 4043 

ACTCACCATA CTCGACTTAA 6ATACTCAAC TCCCGACACT CAACTCAACT CATTTCAATA TAAACCACCt 

4033 4083 4073 4083 4093 41 03 4113 

TAAAACAACT TCACTATAAA CTAAACTTCT TTAAAAACAT CATCTCAACT CTGTCTCTAT AATAACCCAt 
4123 4133 4143 4133 4183 4173 4183 

ACAACATAAC tXTGAAATCT ATATAAAAAT ACAATTAACT CATCTATATA AAAATACATt AATCTATGCG 

4193 4203 4213 4223 4233 4243 4253 

ACATtCTCTA ACCOACAACC ATCACtTAAO CCCTAAOAta ATGAtATACC GATCTACCGC A06CTCCCAT 
4283 4213 4283 4293 4303 4313 4323 

CGCAtCItAT ACCCCCCCAA ACCtATAACA CCTCAACICC CTAATCAATC CACTAATAAA CiattAAAAO 
4333 4343 4353 4383 4373 4383 4393 

OAATCATCTA AAAAOTATOA CCCTTTTCTA CCCATACTGG CTAACATCCT TTATCCCCCC TCTCACTTAT 
4403 4413 4423 4433 4443 4453 4483 

CTCAACTCTC CCCCATATCG 6TCCTCAATA CTACTCCAAA AAATATACTG CTCTTATGTt TAAAAACATA 
4473 4483 4493 4303 4313 4523 4533 

CT6ATTCTCT CCrrTCAAAT TATtCCTTAA AGCttAGATT TTTCAAAACC TCtCTTTTGA AAATCOTACT 
4543 4553 4583 4573 4583 4393 4803 

TTCCTTTTTC TTCTATTAAA CCtAGACATA C6CTATCTAG AACtCTACCt TACCTTCCTT CTCAAAACTT 

4813 4823 4833 4843 4853 

TCAAAACATT T6CTTACATT CTTA6CCACT ACTTACTTCC CTTGTTCCAA TTC 
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10 20 30 40 30 «0 ^0 

AACCTTCTtA AAAACSCAAA TtSAttAAIT tCAACTCAAA AtAATTAAT? ATAACACTOC lAAACCACCr 

ac 90 100 no lao 130 140 

TAACAAACCA TACTTTCAAA WtTACCAAr CCSCTAIATA tTAATCAACT tSATAATATA AAAAAAATTT 

lao X60 X70 180 190 200 210 

CAAt^eeAAA ACCCCCTAAA ArAtTCTCAA ACtAttCCAA ATCCTACAAA ACTACCATCC CtCCACCTAT 

220 230 240 330 2«0 270 2fl0 

TCACTCCAAA ATAAAATtAT TATCCACCTt TCACTTTAAA ATTCACtACT TATATAACAA TTCTAAATtt 

290 300 310 330 330 340 390 

AAACtATTrr AATACTTTTA AAAATACATC CCSTtCAAAX AtTtAATATA AXTtAATTTA TCAATAICAT 

3<0 370 380 390 400 410 420 

TTATAAACCA ACCAACTACC AACtCATTAA TCAtTAAATC CCACCCAAAT TCTACtATCA AAATTCTCCT 

430 440 430 460 470 490 490 

AAACACTACT AAAACAAGAC CAAATTCTTC CACTCCCAAT C5AACCACCA ATCTAATTtA CCTTGACCCC 

300 310 320 330 340 330 360 

CATATtTACG ACCACACTXT CAATACTATT TtTTTCAACC ATCAAtXTCA AAtlTAACAT tAATCCTAAA 

S70 380 390 €00 610 620 630 

CAAOTAGTAC ATCCCCAATt AATTCATGCC TTTTTTAAAT ATAAtTATA? AAATATTXAT CATTTGTTTT 

640 630 W 670 680 690 700 

AAATXnAAA ACTTOAATAT ATTAttTtTt TAAAAATTAT CTATTAACTA CCATCACATA ATTGACACCA 

110 720 730 740 750 760 770 

ACCAATAATt AACAtOAACA TACTCTTTAA TTACtAAtCfi A?6GCtACTA AAtTTATTTA TAAAXTATAT 

7t0 790 too 810 820 830 8<0 

eAATAACTTA AATTATAACA AATATTTCAG CCCCAICTAt VtTAAAAAAT ATTAAATAGT rTGAATTtAA 

850 880 870 880 890 900 910 

AACeetTACA TAAAtCatCA ATtTT C AACe CAAAACTCCA tOACAACCGT ATTrTACACC CAATAGCCC G 

920 930 940 950 960 970 980 

ATOACAACCA TATTttCAAC CCAAIATCtC AT C CAtCAAa GATAATIXrti tATCATTSCT AAtACtTTAA 

990 1000 1010 1020 1030 1040 1050 

ACATATTRA CCTCXtTTTC CCTtCttTAC TTTAtACACT AtAOTCTTAG rrCATCCAAf ATCATCTAT? 

1080 1070 1080 1090 1100 UIO 1120 

ATTtCCCTCT TAAACTATTT TTTATTTTAT AAATtrTTlA AAAATAAAT? ATTTTTtCCA TTTAACTTTG 

lUO 1140 1150 1180 1170 UM 1190 

ATTOTAAtTA AKRTAAAA ATTAeCAACA TAtAAATAAA ATTAAtAnt AACAAACAAT TCTAACAtAA 

1200 laifl 1220 1230 1240 1250 1260 

TAmrtTTA ATTATTCAAA AtAAAtAtTT TtAAACATCA TATAAAACAA ATAC6ACAAA AAAAtTGAGA 

1270 1280 1290 1300 UIO 1320 1J30 

CC6CACAACA CAAGCCACAC AAAAATCTCC AAGAAACTCt TTCCTCtAAA TATCTCTCAT CCAAACTAAT 

1340 1350 1360 1370 1380 1390 1400 

ATAATACCCA TtATAATTAA CCATATTGAC CAACTCAAAC CCCTTAAAAT CTATAAATAG ACAAACCtftt 

1410 1420 1430 1440 1450 1460 14^0 

CCCATACCtC TTATCATAAA AAAAA?AA7A A?C?T???CA AtACACAAGt tXAAAAACtA 7ACCATA7AA 
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X480 1490 1900 1310 1320 1230 1540 

CAATAtATCA ?«TTAICCA AACCAATAt;! ArrCTCCTTC TCATTArtAT rrrtS«:CA TCAATTTCAA 

1950 1560 1570 1560 1590 1600 1610 

CTTCTACAAC CAATCTTATt CAtSACAAtT TATTCAAACA ACTrtATCAT AAtATtcrTG AACAAGAAtt 

1620 1630 . 1640 1150 1660 1670 1660 

TCCTCATCAT TtlCAACCTT ATCTTTCtlA tttCACCAAA AATATTCAAA CCAACAATAA TATTCACAAC 

1690 1700 1710 1720 1730 1740 1750 

GtTGATAAAA AtCSCAXTAA ACTCATTAAr CtACTtACCT TTGCACCTAA CCCTCATCCA AAAACATAtS 

1760 1770 1780 1790 1800 1810 1820 

ATAATAXTCT AACTATTTAA ATAT7SGAAX AXATTTCTCG CCATCAAAAT GAtACACAAT ATAACAATTA 

1830 1840 1850 1860 1970 1880 1990 

trrccAAccA tcaaaactta tattttataa agtagaaaa: iattttctcg rrntACTAA ttaaa©ctca 

1900 1910 1920 1930 1940 1950 1960 

AAAATCAGTT tlCTCGtAAG C6ACCAAACT CATTTTCCAT CCAACT6TAT TTTTttTrXA CTTTTAAtAA 

1970 1980 1990 2000 2010 2020 3030 

CCTCATACTA TTT6CTATAC TCAACAATAA CACACTATTA TTGATSTTTA GT6CTCCAAA AGAAAXTCAT 

3040 2050 2060 2070 2080 2090 2100 

ACTAAXTXXC CXAAXAXAAC XAXCAAXXXC XXAXAXOXAt AXXXXTCAAC CAAAAXAACA AACCCXAATC 

2110 2120 2130 2140 2150 2160 2170 

CAAXAAOXOa OCCXCXACAA XAAAOACXAA CXXCtAXXAA XTCXXAACCX XAXXTAATX? XATCCAAACC 

2180 219P 3300 

TCSACAAAAC 0ACAAX6CXC AACtTATAXT CCAAftC 
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